Optimizing data center controls using neural networks

ABSTRACT

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for improving operational efficiency within a data center by modeling data center performance and predicting power usage efficiency. An example method receives a state input characterizing a current state of a data center. For each data center setting slate, the state input and the data center setting slate are processed through an ensemble of machine learning models. Each machine learning model is configured to receive and process the state input and the data center setting slate to generate an efficiency score that characterizes a predicted resource efficiency of the data center if the data center settings defined by the data center setting slate are adopted t. The method selects, based on the efficiency scores for the data center setting slates, new values for the data center settings.

CROSS-REFERENCE TO RELATED APPLICATION

This application is a continuation of and claims priority to U.S. patentapplication Ser. No. 15/410,547, filed on Jan. 19, 2017. The disclosureof the prior application is considered part of and is incorporated byreference in the disclosure of this application.

BACKGROUND

This specification relates to large-scale data center optimization.

A data center is a facility that holds computer servers for remotestorage, processing, or distribution of large amounts of data. Usingresources, e.g., energy, efficiently is a primary concern for datacenter operators.

SUMMARY

This specification describes technologies for data center optimization.These technologies generally involve methods and systems for applyingmachine learning algorithms to improve data center efficiency.

In general, one innovative aspect of the subject matter described inthis specification can be embodied in methods that include the actionsfor improving operational efficiency within a data center by modelingdata center performance and predicting power usage efficiency. Otherembodiments of this aspect include corresponding computer systems,apparatus, and computer programs recorded on one or more computerstorage devices, each configured to perform the actions of the methods.For a system of one or more computers to be configured to performparticular operations or actions means that the system has installed onit software, firmware, hardware, or a combination of them that inoperation cause the system to perform the operations or actions. For oneor more computer programs to be configured to perform particularoperations or actions means that the one or more programs includeinstructions that, when executed by data processing apparatus, cause theapparatus to perform the operations or actions.

The foregoing and other embodiments can each optionally include one ormore of the following features, alone or in combination. In particular,one embodiment includes all the following features in combination. Anexample method includes: receiving a state input characterizing acurrent state of a data center; for each data center setting slate in afirst set of data center setting slates that each define a respectivecombination of possible data center settings that affect a resourceefficiency of the data center: processing the state input and the datacenter setting slate through each machine learning model in an ensembleof machine learning models, wherein each machine learning model in theensemble is configured to: receive the state input and the data centersetting slate, and process the state input and the data center settingslate to generate an efficiency score that characterizes a predictedresource efficiency of the data center if the data center settingsdefined by the data center setting slate are adopted in response toreceiving the state input; and selecting, based on the efficiency scoresfor the data center settings. One innovative aspect of the subjectmatter described in this specification can be embodied in a systemcomprising one or more computers and one or more storage devices storinginstructions that are operable, when executed by the one or morecomputers to cause the one or more computers to perform operations thatimplement the example method. One innovative aspect of the subjectmatter described in this specification can be embodied in one or morenon-transitory computer-readable mediums comprising instructions storedthereon that are executable by a processing device and upon suchexecution cause the processing device to perform operations thatimplement the example method.

These and other embodiments can optionally include one or more of thefollowing features. The efficiency score can be a predicted long-termpower usage effectiveness (PUE) of the data center if the data centersettings defined by the data center setting slate are adopted inresponse to receiving the state input. The machine learning models canbe configured to generate the predicted long-term PUE of the data centerthrough training the machine learning models on training data thatincludes a plurality of training state inputs and, for each trainingstate input, a target PUE that was the PUE of the data center apredetermined time after the data center was in the state characterizedby the training state input. Raw training state inputs can be receivedand preprocessed to generate the training state inputs. Thepredetermined time may be greater than thirty minutes. The predeterminedtime may be an hour. Selecting new values for the data center settingsmay include: determining, for each data center setting slate, anaggregate resource efficiency score from the efficiency scores generatedfor the data center setting slate by the ensemble of models;determining, for each data center setting slate, a measure of variationof the efficiency scores generated for the data center setting slate bythe ensemble of models; ranking the data center setting slates based onthe aggregate resource efficiency scores and the measures of variation;and selecting the combination of possible data center settings definedby a highest-ranked data center setting slates as the new values for thedata center settings. The aggregate resource efficiency score is ameasure of central tendency of the efficiency scores generated by theensemble of models. Ranking the data center setting slates may compriseranking the data center slates in an exploitative manner by penalizingdata center setting slates that have higher measures of variation morein the ranking than data center setting slates that have lower measuresof variation. Ranking the data center setting slates may compriseranking the data center slates in an explorative manner by promotingdata center setting slates that have higher measures of variation morein the ranking than data center setting slates that have lower measuresof variation. Data may identify a second set of data center settingslates and the first set of data center setting slates may be generatedby removing from the second set of data center slates any data centersetting slate that would, if the data center settings defined by thedata center setting slate were adopted in response to receiving thestate input, result in any of one or more operating constraints for thedata center being violated. Generating the first set of data centersetting slates may comprise, for each operating constraint: for eachdata center setting slate in the second set of data center settingslates: processing the state input and the data center setting slatethrough one or more constraint machine learning model that are specificto the operation constraint, wherein each constraint machine learningmodel is configured to: receive the state input and the data centersetting slate, and process the state input and the data center settingslate to generate a constraint score that characterizes a predictedvalue of an operating property of the data center settings defined bythe data center setting slate are adopted in response to receiving thestate input; generating a final constraint score for each data centersetting slate for the constraint scores generated by the one or moreconstraint machine learning models that are specific to the constraint;and removing from the second set any data center setting slates from thesecond set of data center setting slates based on the constraint scoresto generate the first set of data center setting slates. One of theoperating constraints may be a constraint on the temperature of the datacenter over a next hour and the operating property of the data centerthat corresponds to the operating constraint is the temperature of thedata center over the next hour. One of the operating constraints may bea constraint on the pressure of the data center over a next hour and theoperating property of the data center that corresponds to the operatingconstraint is the pressure of the data center over the next hour. Eachmachine learning model in the ensemble may have been trained on adifferent sample of training data from each other machine learning modelin the ensemble or has a different model architecture from each othermachine learning model in the ensemble. Data may be received thatidentifies a true value of the efficiency score for the data center at atime after the data center was in the current state and the currentstate input may be used, along with the new values for the data centersettings, and the true value of the efficiency score in re-training theensemble of machine learning models.

The subject matter described in this specification can be implemented inparticular embodiments so as to realize one or more of the followingadvantages. An example system uses machine learning model algorithms toimprove operational efficiency within data centers by modeling datacenter performance and predicting resource efficiency.

Data centers make optimization of operational efficiency difficultbecause they have complex interactions among multiple mechanical,electrical, and control systems. Data center equipment, the operation ofthat equipment, and the environment interact with each other in a numberof operating configurations and with nonlinear interdependencies.Additionally, each data center has a unique architecture andenvironment. Therefore, optimizing data center parameters for one datacenter may not be applicable for another datacenter. Machine learningmodels can be trained on different operating scenarios and parameterswithin data centers to produce an efficient and adaptive framework thatunderstands data center dynamics and optimizes efficiency within datacenters. By applying machine learning techniques to model complexdynamics, data center control systems implement actions that are focusedon long term efficiency. Machine learning models can be optimized fordifferent criteria including: lowest power usage, lowest water usage,lowest money spent on electricity, and most possible CPU loads that canbe put in a datacenter.

By being able to predict operational efficiency, engineers can plan newdata centers that they know are efficient with the minimum amount ofresources.

The details of one or more embodiments of the subject matter of thisspecification are set forth in the accompanying drawings and thedescription below. Other features, aspects, and advantages of thesubject matter will become apparent from the description, the drawings,and the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is shows an example of the efficiency management system.

FIG. 2 is a flowchart of an example process for improving operationalefficiency within a data center by modeling data center performance andpredicting power usage efficiency.

FIG. 3 is a flowchart of an example process for training ensembles ofmodels to predict power usage efficiency.

Like reference numbers and designations in the various drawings indicatelike elements.

DETAILED DESCRIPTION

The specification generally describes an efficiency management systemthat provides optimization recommendations to improve efficiency withina data center.

In a data center, many possible combinations of hardware, e.g.,mechanical and electrical equipment, and software, e.g., controlstrategies and set points, contribute to data center efficiency. Forexample, one of the primary sources of energy use in the data centerenvironment is cooling. Data centers generate heat that must be removedto keep the servers running. Cooling is typically accomplished by largeindustrial equipment such as pumps, chillers, and cooling towers.

However, a simple change to a cold aisle temperature set point willproduce load variations in the cooling infrastructures of the datacenter, e.g., chillers, cooling towers, heat exchangers, and pumps.These load variations cause nonlinear changes in equipment efficiency.The number of possible operating configurations and various feedbackloops among data center equipment, equipment operation, and the datacenter environment make it difficult to optimize efficiency. Testingeach and every feature combination to maximize data center efficiency isunfeasible given time constraints, frequent fluctuations in data centersensor information and weather conditions, and the need to maintain astable data center environment. Traditional engineering formulas forpredictive modeling often produce large errors because they fail tocapture the complex interdependencies of systems in the data center.

FIG. 1 shows an example of an efficiency management system (100). Theefficiency management system (100) is an example of a system implementedas computer programs on one or more computers in one or more locations,in which the systems, components, and techniques described below can beimplemented.

The efficiency management system (100) receives state data (140)characterizing the current state of a data center (104) and providesupdated data center settings (120) to a control system (102) thatmanages the settings of the data center (104).

The efficiency management system (100) can take in, as input, state data(140) representing the current state of the data center (104). Thisstate data (140) can come from sensor readings of sensors in the datacenter (104) and operating scenarios within the data center (104). Thestate data may include data such as temperatures, power, pump speeds,and set points.

The efficiency management system (100) uses this data to determine datacenter settings (120) that should be changed in the data center (104) inorder to make the data center (104) more efficient.

Once the efficiency management system (100) determines the data centersettings (120) that will make the data center (104) more efficient, theefficiency management system (100) provides the updated data centersettings (120) to the control system (102). The control system (102)uses the updated data center settings (120) to set the data center (104)values. For example, if the efficiency management system (100)determines that an additional cooling tower should be turned on in thedata center (104), the efficiency management system (100) can eitherprovide the updated data center settings (120) to a user who updates thesettings or to the control system (102), which automatically adopts thesettings without user interaction. The control system (102) can send thesignal to the data center to increase the number of cooling towers thatare powered on and functioning in the data center (104).

The efficiency management system (100) can train an ensemble of machinelearning models (132A-132N) using a model training subsystem (160) topredict the resource efficiency of the data center (104) if particulardata center settings are adopted. In some cases, the efficiencymanagement system (100) can train a single machine learning model topredict the resource efficiency of the data center if particular datacenter settings are adopted.

In particular, each machine learning model (132A-132N) in the ensembleis configured through training to receive a state input characterizingthe current state of the data center (104) and a data center settingslate that defines a combination of possible data center settings and toprocess the state input and the data center setting slate to generate anefficiency score that characterizes a predicted resource efficiency ofthe data center if the data center settings defined by the data centersetting slate are adopted.

In some implementations, the efficiency score represents a predictedpower usage effectiveness (PUE) of the data center if the settings of aparticular slate are adopted by the data center (104). PUE is defined asthe ratio of the total building energy usage to the informationtechnology energy usage.

In some implementations, the efficiency score represents a predictedwater usage of the data center if the settings of a particular slate areadopted by the data center (104). In other implementations, theefficiency score represents a predicted monetary amount spent onelectricity. In other implementations, the efficiency score represents apredicted load amount that can be put in a datacenter.

In some implementations, each machine learning model (132A-132N) is aneural network, e.g., a deep neural network, that the efficiencymanagement system (100) can train to produce an efficiency score.

Neural networks are machine learning models that employ one or morelayers of models to generate an output, e.g., one or moreclassifications, for a received input. Deep neural networks include oneor more hidden layers in addition to an output layer. The output of eachhidden layer is used as input to the next layer in the network, i.e.,the next hidden layer or the output layer. Each layer of the neuralnetwork generates an output from a received input in accordance withcurrent values of a respective set of parameters for the layer.

The model training subsystem (120) uses historical data from a datacenter (104) to create different datasets of sensor data from the datacenter. Each machine learning model (132A-132N) in the ensemble ofmachine learning models can be trained on one dataset of historicalsensor data.

The efficiency management system (100) can train additional ensembles ofconstraint machine learning models (112A-112N) using the model trainingsubsystem (160) to predict an operating property of the data center thatcorresponds to an operating constraint if the data center (104) adoptscertain data center settings (102).

As will be described in more detail below, if the efficiency managementsystem (100) determines that a constraint model predicts that the valueof a given data center setting will violate a constraint of the datacenter, the efficiency management system will discard the violatingsetting.

Each constraint model (112A-112N) is a machine learning model, e.g., adeep neural network, that is trained to predict certain values of anoperating property of the data center over a period of time if the datacenter adopts a given input setting. For example, the model trainingsubsystem (160) can train one constraint model to predict the futurewater temperature of the data center over the next hour given inputstate data (140) and potential settings. The model training subsystem120 can train another constraint model to predict the water pressureover the next hour given the state data (140) and potential settings.

A setting slate management subsystem (110) within the efficiencymanagement system (100) preprocesses the state data (140) and constructsa set of setting slates that represent data center setting values thatcan be set for various parts of the data center given the knownoperating conditions and the current state of the data center (104).Each setting slate defines a respective combination of possible datacenter settings that affect the efficiency of the data center (104).

For example, the efficiency management system (100) may determine themost resource efficient settings for a cooling system of the data center(104). The cooling system may have the following architecture: (1)servers heat up the air on the server floor; (2) the air is cycled andthe heat is transferred to the process water system; (3) the processwater system is cycled and connects to the condenser water system usinga heat sync; and (4) the condenser water system takes the heat from theprocess water system and transfers it to the outside air using coolingtowers or large fans.

To efficiently control the cooling system, the efficiency managementsystem (100) may construct different potential setting slates thatinclude various temperatures for the cooling tower set points, coolingtower bypass valve positions, cooling unit condenser water pump speeds,a number of cooling units running, and/or process water differentialpressure set points.

As an example, one setting slate may include the following values: 68degrees as the temperature for the cooling tower set points, 27 degreesas the cooling tower bypass valve position, 500 rpm as the cooling unitcondenser water pump speed, and 10 as the number of cooling unitsrunning.

Other examples of slate settings that impact efficiency of the datacenter (104) include: potential power usage across various parts of thedata center; certain temperature settings across the data center; agiven water pressure; specific fan or pump speeds; and a number and typeof the running data center equipment such as cooling towers and waterpumps.

During preprocessing, the setting slate management subsystem (110) canremove data with invalid power usage efficiency, replace missing datafor a given data setting with a mean value for that data setting, and/orremove a percentage of data settings. The setting slate managementsystem (110) discretizes all of the action dimensions and generates anexhaustive set of possible action combinations. For any continuousaction dimensions, the system converts the action into a discrete set ofpossible values. For example, if one of the action dimensions is a valvethat has a value from 0.0 to 1.0, the system may discretize the valuesinto the set [0.0, 0.05, 0.1, 0.15, . . . , 1.0]. The system maydiscretize for every dimension and the full action set is every possiblecombination of the values. The system then removes all actions thatviolate the constraint models.

The setting slate management subsystem (110) sends the constructed setof setting slates and the current state of the data center (104) to theconstraint models (112A-112N). The setting slate management subsystemthen determines whether certain data center setting slates, if chosen bythe system, are predicted to result in violations of operatingconstraints for the data center. The setting slate management subsystem(100) removes any data center setting slates from the set of settingslates that are predicted to violate the constraints of the data center.

The efficiency management system (100) sends the updated set of settingslates and the state data (140) to the ensemble of machine learningmodels (132A-132N) that use the state data and the setting slates togenerate efficiency scores as output.

Since each machine learning model in the ensemble of models is trainedon a different dataset than the other models, each model has thepotential to provide a different predicted PUE output when all themachine learning models in the ensemble are run with the same datacenter setting as input. Additionally or alternatively each machinelearning model may have a different architecture which could also causeeach model to potentially provide a different predicted PUE output.

The efficiency management system (100) can choose data center settingvalues that focus on long-term efficiency of the data center. Forexample, some data center setting values provide long-term power usageefficiency for the data center, e.g., ensuring that the power usage inthe data center is efficient for a long predetermined time after thedata center was in the state characterized by the state input. Long-termpower usage efficiency may be for time durations of thirty minutes, onehour, or longer from the time the data center was in the input statewhereas short term power usage efficiency focuses on a short time afterthe data center was in the input state, e.g., immediately after or fiveseconds after, the data center was in the input state.

The system can optimize the machine learning models for long termefficiency so that the models can make predictions based on the dynamicsof the data center and are less likely to provide recommendations forslate settings that yield good results in the short term, but are badfor efficiency over the long term. For example, the system can predictPUE over the next day, assuming that optimal actions will continue to betaken every hour. The system can then take actions that it knows willlead to the best PUE over the whole day, even if the PUE for a givenhour is worse than the previous hour.

The efficiency management system (100) determines the final efficiencyscore for a given setting slate based on the efficiency scores of eachmachine learning model in the ensemble of models for a given settingslate to produce one overall efficiency score per setting slate.

The efficiency management system (100) then either recommends or selectsnew values for the data center settings based on the efficiency scoresassigned to each slate from the machine learning models (132A-132N). Theefficiency management system can send the recommendations to a datacenter operator to be implemented, e.g., by being presented to the datacenter operator on a user computer, or set automatically without needingto be sent to a data center operator.

FIG. 2 is a flowchart of an example process 200 for selecting a settingslate for a data center using an ensemble of machine learning models.For convenience, the process 200 will be described as being performed bya system of one or more computers, located in one or more locations, andprogrammed appropriately in accordance with this specification. Forexample, an efficiency management system, e.g., the efficiencymanagement system 100 of FIG. 1, appropriately programmed, can performthe process 200.

The system receives a state input representing a current state of a datacenter (210) and generates a set of setting slates. As disclosed above,each setting slate defines a respective combination of possible datacenter settings or actions that affect the efficiency of the data centergiven the current state of the data center. The system sends the set ofsetting slates to the constraint models to determine whether any of thesetting slates are predicted to violate constraints of the data center.The system then removes data center setting slates from the set ofsetting slates that are predicted to violate the constraints.

For each data center setting slate in the set of data center settingslates, the system processes the state input and the data center settingslate through each machine learning model in an ensemble of machinelearning models to generate an efficiency score for each machinelearning model (220). Since each machine learning model is trained on adifferent dataset, each machine learning model has the potential toprovide a different efficiency score for a setting slate than the othermachine learning models.

The system then selects an overall efficiency score for the settingslate based on the efficiency scores generated by each machine learningmodel for the setting slate.

Each model in the ensemble of machine learning models provides apredicted PUE for each data center setting slate. To determine the finalpredicted PUE for a setting slate, the system can use various methodsincluding: choosing the predicted PUE with the lowest mean or using apessimistic or optimistic algorithm to determine a final predicted PUE.

To choose the predicted PUE with the lowest mean, the system determinesthe mean value of the predicted PUE scores output by the ensemble ofmodels. The efficiency management system then chooses the predicted PUEwith the lowest mean.

In some other implementations, for each data center setting slate, thesystem determines an aggregate efficiency score from the efficiencyscores generated for the data center setting slate by the ensemble ofmodels, e.g., a mean or other measure of central tendency of theefficiency scores. The system then determines a measure of variation,e.g., a standard deviation, of the efficiency scores generated for thedata center setting slate by the ensemble of models. The data centersetting slates are then ranked based on the aggregated efficiency scoresand the measures of variation. The system selects the combination ofpossible data center settings that are defined by a highest-ranked datacenter setting slate.

For example, in some instances, the system determines the ranking usinga pessimistic algorithm that is used for exploitation or an optimisticalgorithm that is used for exploration.

In exploitation mode, the system ranks the data center slates in anexploitative manner by penalizing data center setting slates that havehigher measures of variation more in the ranking than data centersetting slates that have lower measures of variation. For example, thesystem can generate the ranking by, for each setting slate, determininga final predicted PUE by adding λ₁ multiplied by the standard deviationof the PUE value to the mean PUE, with λ₁ being a predetermined constantvalue and then ranking the setting slates by their final predicted PUEs.

In exploration mode, the system ranks the data center slates in anexplorative manner by promoting data center setting slates that havehigher measures of variation more in the ranking than data centersetting slates that have lower measures of variation. For example, thesystem can generate the ranking by, for each setting slate, determininga final predicted PUE by subtracting λ₁ multiplied by the standarddeviation of the target value from the mean PUE value, with λ₁ being apredetermined constant value and then ranking the setting slates bytheir final predicted PUEs.

The models can be fast learning models that have memory architecturesand are taught to remember bad actions, including actions that make thedata center less efficient.

The system selects, based on the efficiency scores for the data centersetting slates in the set of data center setting slates, new values forthe data center settings (230). The system then recommends the selecteddata setting values to the control system to update the settings of thedata center. In some implementations, the system generatesrecommendations on a per time step basis and displays them to thecontrol system so that the data center operators can take action andupdate the data center settings. In other implementations, the systemuses the data settings to automatically update the data center withouthuman interaction.

FIG. 3 is a flowchart of an example process 300 for training ensemblesof models to predict power usage efficiency for a given data centersetting slate. For convenience, the process 300 will be described asbeing performed by a system of one or more computers, located in one ormore locations, and programmed appropriately in accordance with thisspecification. For example, an efficiency management system, e.g., theefficiency management system 100 of FIG. 1, appropriately programmed,can perform the process 300.

As disclosed above, the system (100) uses historical data from datacenters as well as other information to train the ensembles of machinelearning models to determine efficiency scores (310). The historicaldata can come from past sensor readings from sensors across the datacenter. Example data includes: power usage across various parts of adata center such as the server floor, cooling system, networking room,and individual fans; temperature sensors across the data center such asin the water cooling system, on the server floor, and in the chiller;water and/or air speed in various parts of the data center such as thedifferential pressure of the server floor air and the differentialpressure in the water cooling system; fan and/or pump speeds such as thecooling tower fan speeds and the process water pump speed; weather,i.e., the outside air temperature, humidity, and/or air pressure, andforecasts of future weather; and equipment status, i.e., whether thechiller is running, how many cooling towers are on and how many pumpsare running.

The system then samples the data with replacement to create differentdatasets for each machine learning model in the ensemble, e.g., 10(320).

The system trains several machine learning models to produce an ensembleof models that each predict an average future power usage effectiveness(PUE) given a potential data center setting slate as model input (330).The system trains each machine learning model on one of the createddatasets. The models use supervised learning algorithms to analyzetraining data and produce inferred functions. The models contain areinforcement learning loop that provides delayed feedback that uses areward signal. In this loop, models map from state to action andevaluate the tradeoff between exploration and exploitation actions.

During training, the model training subsystem (120) uses techniques ofdeep learning including batch normalization, dropout, rectified linearunits, early stopping, and other techniques to train the models. Thesystem can train models using bootstrapping to obtain estimates of themean and variance for each prediction which allows the models toincorporate uncertainty into their predictions. By incorporatinguncertainty, models operate in a more stable manner.

The system also trains constraint models to predict and/or constraincertain data center settings (340). Trained models can be transferredamong data centers so that new data centers have models that are trainedon real-time data so that the data centers have optimized settings fordata center efficiency.

At inference time, the system uses the ensemble of models and theconstrain models to recommend a setting slate for the data center (350).As described above, the system constructs a set of setting slates withinthe known operating conditions of the data center. The system also looksat the current state of the data center using the data center sensors.The set of setting slates and the current state of the data center arerun through the constraint models to remove any settings that arepredicted to violate constraints of the data center. The reduced set ofsetting slates and the current state of the data center are run throughthe ensemble of models. These models can be trained to predict powerusage efficiency. The system selects the setting slate with the lowestpredicted power usage efficiency. The system can then either send thesetting slate as a recommendation to a data center operator or use thesetting slate to directly control the data center settings or equipment.The system may generate the recommendations on a per time-step basis,e.g., hourly. Long term planning becomes more important for the systemthe more often recommendations are provided.

Optionally, the system can re-train the models using true values of theefficiency scores for the data center after the setting slatesrecommended by the system have been adopted. That is, the system canreceive data identifying a true value of the efficiency score for thedata center at a time after the data center was in a current state anduse the current state input characterizing the current state, the newvalues for the data center settings that were recommended by the system,and the true value of the efficiency score in re-training the ensembleof machine learning models, e.g., after a threshold number of truevalues have been received or in response to determining that theperformance of the machine learning models has deteriorated.

Embodiments of the subject matter and the functional operationsdescribed in this specification can be implemented in digital electroniccircuitry, in tangibly-embodied computer software or firmware, incomputer hardware, including the structures disclosed in thisspecification and their structural equivalents, or in combinations ofone or more of them. Embodiments of the subject matter described in thisspecification can be implemented as one or more computer programs, i.e.,one or more modules of computer program instructions encoded on atangible non-transitory storage medium for execution by, or to controlthe operation of, data processing apparatus. The computer storage mediumcan be a machine-readable storage device, a machine-readable storagesubstrate, a random or serial access memory device, or a combination ofone or more of them. Alternatively or in addition, the programinstructions can be encoded on an artificially-generated propagatedsignal, e.g., a machine-generated electrical, optical, orelectromagnetic signal, that is generated to encode information fortransmission to suitable receiver apparatus for execution by a dataprocessing apparatus. The term “data processing apparatus” refers todata processing hardware and encompasses all kinds of apparatus,devices, and machines for processing data, including by way of example aprogrammable processor, a computer, or multiple processors or computers.The apparatus can also be, or further include, special purpose logiccircuitry, e.g., an FPGA (field programmable gate array) or an ASIC(application-specific integrated circuit). The apparatus can optionallyinclude, in addition to hardware, code that creates an executionenvironment for computer programs, e.g., code that constitutes processorfirmware, a protocol stack, a database management system, an operatingsystem, or a combination of one or more of them.

A computer program, which may also be referred to or described as aprogram, software, a software application, an app, a module, a softwaremodule, a script, or code, can be written in any form of programminglanguage, including compiled or interpreted languages, or declarative orprocedural languages; and it can be deployed in any form, including as astand-alone program or as a module, component, subroutine, or other unitsuitable for use in a computing environment. A program may, but neednot, correspond to a file in a file system. A program can be stored in aportion of a file that holds other programs or data, e.g., one or morescripts stored in a markup language document, in a single file dedicatedto the program in question, or in multiple coordinated files, e.g.,files that store one or more modules, sub-programs, or portions of code.A computer program can be deployed to be executed on one computer or onmultiple computers that are located at one site or distributed acrossmultiple sites and interconnected by a data communication network.

The processes and logic flows described in this specification can beperformed by one or more programmable computers executing one or morecomputer programs to perform functions by operating on input data andgenerating output. The processes and logic flows can also be performedby special purpose logic circuitry, e.g., an FPGA or an ASIC, or by acombination of special purpose logic circuitry and one or moreprogrammed computers.

Computers suitable for the execution of a computer program can be basedon general or special purpose microprocessors or both, or any other kindof central processing unit. Generally, a central processing unit willreceive instructions and data from a read-only memory or a random accessmemory or both. The essential elements of a computer are a centralprocessing unit for performing or executing instructions and one or morememory devices for storing instructions and data. The central processingunit and the memory can be supplemented by, or incorporated in, specialpurpose logic circuitry. Generally, a computer will also include, or beoperatively coupled to receive data from or transfer data to, or both,one or more mass storage devices for storing data, e.g., magnetic,magneto-optical disks, or optical disks. However, a computer need nothave such devices. Moreover, a computer can be embedded in anotherdevice, e.g., a mobile telephone, a personal digital assistant (PDA), amobile audio or video player, a game console, a Global PositioningSystem (GPS) receiver, or a portable storage device, e.g., a universalserial bus (USB) flash drive, to name just a few.

Computer-readable media suitable for storing computer programinstructions and data include all forms of non-volatile memory, mediaand memory devices, including by way of example semiconductor memorydevices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks,e.g., internal hard disks or removable disks; magneto-optical disks; andCD-ROM and DVD-ROM disks.

To provide for interaction with a user, embodiments of the subjectmatter described in this specification can be implemented on a computerhaving a display device, e.g., a CRT (cathode ray tube) or LCD (liquidcrystal display) monitor, for displaying information to the user and akeyboard and a pointing device, e.g., a mouse or a trackball, by whichthe user can provide input to the computer. Other kinds of devices canbe used to provide for interaction with a user as well; for example,feedback provided to the user can be any form of sensory feedback, e.g.,visual feedback, auditory feedback, or tactile feedback; and input fromthe user can be received in any form, including acoustic, speech, ortactile input. In addition, a computer can interact with a user bysending documents to and receiving documents from a device that is usedby the user; for example, by sending web pages to a web browser on auser's device in response to requests received from the web browser.Also, a computer can interact with a user by sending text messages orother forms of message to a personal device, e.g., a smartphone, runninga messaging application, and receiving responsive messages from the userin return.

Embodiments of the subject matter described in this specification can beimplemented in a computing system that includes a back-end component,e.g., as a data server, or that includes a middleware component, e.g.,an application server, or that includes a front-end component, e.g., aclient computer having a graphical user interface, a web browser, or anapp through which a user can interact with an implementation of thesubject matter described in this specification, or any combination ofone or more such back-end, middleware, or front-end components. Thecomponents of the system can be interconnected by any form or medium ofdigital data communication, e.g., a communication network. Examples ofcommunication networks include a local area network (LAN) and a widearea network (WAN), e.g., the Internet.

The computing system can include clients and servers. A client andserver are generally remote from each other and typically interactthrough a communication network. The relationship of client and serverarises by virtue of computer programs running on the respectivecomputers and having a client-server relationship to each other. In someembodiments, a server transmits data, e.g., an HTML page, to a userdevice, e.g., for purposes of displaying data to and receiving userinput from a user interacting with the device, which acts as a client.Data generated at the user device, e.g., a result of the userinteraction, can be received at the server from the device.

While this specification contains many specific implementation details,these should not be construed as limitations on the scope of anyinvention or on the scope of what may be claimed, but rather asdescriptions of features that may be specific to particular embodimentsof particular inventions. Certain features that are described in thisspecification in the context of separate embodiments can also beimplemented in combination in a single embodiment. Conversely, variousfeatures that are described in the context of a single embodiment canalso be implemented in multiple embodiments separately or in anysuitable subcombination. Moreover, although features may be describedabove as acting in certain combinations and even initially be claimed assuch, one or more features from a claimed combination can in some casesbe excised from the combination, and the claimed combination may bedirected to a subcombination or variation of a sub combination.

Similarly, while operations are depicted in the drawings in a particularorder, this should not be understood as requiring that such operationsbe performed in the particular order shown or in sequential order, orthat all illustrated operations be performed, to achieve desirableresults. In certain circumstances, multitasking and parallel processingmay be advantageous. Moreover, the separation of various system modulesand components in the embodiments described above should not beunderstood as requiring such separation in all embodiments, and itshould be understood that the described program components and systemscan generally be integrated together in a single software product orpackaged into multiple software products.

Particular embodiments of the subject matter have been described. Otherembodiments are within the scope of the following claims. For example,the actions recited in the claims can be performed in a different orderand still achieve desirable results. As one example, the processesdepicted in the accompanying figures do not necessarily require theparticular order shown, or sequential order, to achieve desirableresults. In some cases, multitasking and parallel processing may beadvantageous.

What is claimed is:
 1. (canceled)
 2. A method comprising: receiving astate input characterizing a state of a facility and data defining aplurality of facility setting slates, each slate defining a respectivecombination of facility settings for the facility; for each facilityslate in the plurality of facility setting slates and for a firstoperating constraint of a plurality of operating constraints for thefacility, processing the state input and the facility slate through oneor more first machine learning models that are each specific to thefirst operating constraint to generate a first constraint score, whereinfacility the first constraint score characterizes a predicted value ofan operating property of the facility if facility settings defined bythe facility slate are adopted in response to receiving the state input;generating a filtered plurality of setting slates, comprising filteringslates from the plurality of facility setting slates having a respectivefirst constraint score that does not satisfy a predetermined thresholdvalue for the first operating constraint; processing each slate of thefiltered plurality of facility setting slates through one or more secondmachine learning models to generate an efficiency score for the slate,the efficiency score characterizing facility if the facility settingsdefined by the facility setting slate are adopted in response toreceiving the state input; and selecting, based on the respectiveefficiency scores for the facility setting slates in the second set offacility setting slates, new settings for the facility.
 3. The method ofclaim 2, wherein processing the state input and the slate through theone or more first machine learning models comprises: processing thestate input and the slate through each of the one or more first machinelearning models to obtain a respective candidate constraint score, therespective candidate constraint score characterizing a predicted valueof the operating property of the facility according to the first machinelearning model; and generating the respective first constraint scorebased on an average of the respective candidate constraint scores. 4.The method of claim 2, wherein processing each slate of the filteredplurality of facility settings through the one or more second machinelearning models to generate the efficiency score for the slatecomprises: processing the slate through each of the one or more secondmachine learning models to generate a respective candidate efficiencyscore, the respective candidate efficiency score characterizing apredicted resource efficiency of the facility according to the secondmachine learning model; and generating the efficiency score for theslate based on an average of the respective candidate efficiency scores.5. The method of claim 4, wherein selecting the new settings based onthe respective final efficiency scores comprises: ranking the facilitysetting slates based on the efficiency scores; and selecting settingsfrom a highest-ranked slate as the new settings for the facility.
 6. Themethod of claim 4, wherein each second machine learning model of the oneor more second machine learning models: has been trained on a differentsample of training data from each other second machine learning model ofthe one or more second machine learning models, or has a different modelarchitecture from each other second machine learning model of the one ormore second machine learning models.
 7. The method of claim 2, furthercomprising: for each slate in the plurality of facility setting slatesand for a second operating constraint of the plurality of operatingconstraints for the facility that is different than the first operatingconstraint, processing the state input and the slate through one or moremachine learning models specific to the second operating constraint togenerate a respective second constraint score; and filtering slates fromthe plurality of facility setting slates having a respective secondconstraint score that does not satisfy a predetermined threshold valuefor the second operating constraint.
 8. The method of claim 2, furthercomprising: receiving a second state input characterizing a state of asecond facility different from the facility; receiving data defining asecond plurality of facility setting slates for the second facility; foreach slate in the second plurality of facility setting slates and for asecond operating constraint of the plurality of operating constraints,processing the state input and the slate through the one or more firstmachine learning model specific to the first operating constraint togenerate a respective first constraint score for the slate; andfiltering slates from the second plurality of facility setting slateshaving a respective first constraint score that does not satisfy thepredetermined threshold value for the first operating constraint.
 9. Themethod of claim 2, wherein the respective efficiency score is apredicted long-term power usage effectiveness of the facility if thefacility settings defined by the slate are adopted in response toreceiving the state input.
 10. A system comprising: one or morecomputers and one or more storage devices on which are storedinstructions that are operable, when executed by the one or morecomputers, to cause the one or more computers to perform operationscomprising: receiving a state input characterizing a state of a facilityand data defining a plurality of facility setting slates, each slatedefining a respective combination of facility settings for the facility;for each slate in the plurality of facility setting slates and for afirst operating constraint of a plurality of operating constraints forthe facility, processing the state input and the slate through one ormore first machine learning models that are each specific to the firstoperating constraint to generate a first constraint score, wherein thefirst constraint score characterizes a predicted value of an operatingproperty of the facility if settings defined by the slate are adopted inresponse to receiving the state input; generating a filtered pluralityof setting slates, comprising filtering slates from the plurality offacility setting slates having a respective first constraint score thatdoes not satisfy a predetermined threshold value for the first operatingconstraint; processing each slate of the filtered plurality of facilitysetting slates through one or more second machine learning models togenerate an efficiency score for the slate, the efficiency scorecharacterizing a predicted resource efficiency of the facility if thefacility settings defined by the facility setting slate are adopted inresponse to receiving the state input; and selecting, based on therespective efficiency scores for the facility setting slates in thesecond set of facility setting slates, new settings for the facility.11. The system of claim 10, wherein processing the state input and theslate through the one or more first machine learning models comprises:processing the state input and the slate through each of the one or morefirst machine learning models to obtain a respective candidateconstraint score, the respective candidate constraint scorecharacterizing a predicted value of the operating property of thefacility according to the first machine learning model; and generatingthe respective first constraint score based on an average of therespective candidate constraint scores.
 12. The system of claim 10,wherein processing each slate of the filtered plurality of facilitysettings through the one or more second machine learning models togenerate the efficiency score for the slate comprises: processing theslate through each of the one or more second machine learning models togenerate a respective candidate efficiency score, the respectivecandidate efficiency score characterizing a predicted resourceefficiency of the facility according to the second machine learningmodel; and generating the efficiency score for the slate based on anaverage of the respective candidate efficiency scores.
 13. The system ofclaim 12, wherein selecting the new settings based on the respectivefinal efficiency scores comprises: ranking the facility setting slatesbased on the efficiency scores; and selecting settings from ahighest-ranked slate as the new settings for the facility.
 14. Thesystem of claim 12, wherein each second machine learning model of theone or more second machine learning models: has been trained on adifferent sample of training data from each other second machinelearning model of the one or more second machine learning models, or hasa different model architecture from each other second machine learningmodel of the one or more second machine learning models.
 15. The systemof claim 10, wherein the operations further comprise: for each slate inthe plurality of facility setting slates and for a second operatingconstraint of the plurality of operating constraints for the facilitythat is different than the first operating constraint, processing thestate input and the slate through one or more machine learning modelsspecific to the second operating constraint to generate a respectivesecond constraint score; and filtering slates from the plurality offacility setting slates having a respective second constraint score thatdoes not satisfy a predetermined threshold value for the secondoperating constraint.
 16. The system of claim 10, wherein the operationsfurther comprise: receiving a second state input characterizing a stateof a second facility different from the facility; receiving datadefining a second plurality of facility setting slates for the secondfacility; for each slate in the second plurality of facility settingslates and for a second operating constraint of the plurality ofoperating constraints, processing the state input and the slate throughthe one or more first machine learning model specific to the firstoperating constraint to generate a respective first constraint score forthe slate; and filtering slates from the second plurality of facilitysetting slates having a respective first constraint score that does notsatisfy the predetermined threshold value for the first operatingconstraint.
 17. The system of claim 10, wherein the respectiveefficiency score is a predicted long-term power usage effectiveness ofthe facility if the facility settings defined by the slate are adoptedin response to receiving the state input.
 18. One or more non-transitorycomputer-readable storage media encoded with instructions that, whenexecuted by one or more computers, cause the one or more computers toperform operations comprising: receiving a state input characterizing astate of a facility and data defining a plurality of facility settingslates, each slate defining a respective combination of facilitysettings for the facility; for each slate in the plurality of facilitysetting slates and for a first operating constraint of a plurality ofoperating constraints for the facility, processing the state input andthe slate through one or more first machine learning models that areeach specific to the first operating constraint to generate a firstconstraint score, wherein the first constraint score characterizes apredicted value of an operating property of the facility if settingsdefined by the slate are adopted in response to receiving the stateinput; generating a filtered plurality of setting slates, comprisingfiltering slates from the plurality of facility setting slates having arespective first constraint score that does not satisfy a predeterminedthreshold value for the first operating constraint; processing eachslate of the filtered plurality of facility setting slates through oneor more second machine learning models to generate an efficiency scorefor the slate, the efficiency score characterizing a predicted resourceefficiency of the facility if the facility settings defined by thefacility setting slate are adopted in response to receiving the stateinput; and selecting, based on the respective efficiency scores for thefacility setting slates in the second set of facility setting slates,new settings for the facility.
 19. The computer-readable media of claim18, wherein processing the state input and the slate through the one ormore first machine learning models comprises: processing the state inputand the slate through each of the one or more first machine learningmodels to obtain a respective candidate constraint score, the respectivecandidate constraint score characterizing a predicted value of theoperating property of the facility according to the first machinelearning model; and generating the respective first constraint scorebased on an average of the respective candidate constraint scores. 20.The computer-readable media of claim 18, wherein processing each slateof the filtered plurality of facility settings through the one or moresecond machine learning models to generate the efficiency score for theslate comprises: processing the slate through each of the one or moresecond machine learning models to generate a respective candidateefficiency score, the respective candidate efficiency scorecharacterizing a predicted resource efficiency of the facility accordingto the second machine learning model; and generating the efficiencyscore for the slate based on an average of the respective candidateefficiency scores.
 21. The computer-readable media of claim 18, whereinthe operations further comprise: for each slate in the plurality offacility setting slates and for a second operating constraint of theplurality of operating constraints for the facility that is differentthan the first operating constraint, processing the state input and theslate through one or more machine learning models specific to the secondoperating constraint to generate a respective second constraint score;and filtering slates from the plurality of facility setting slateshaving a respective second constraint score that does not satisfy apredetermined threshold value for the second operating constraint.